Applying a Data Miner To Heterogeneous Schema Integration

نویسندگان

  • Son Dao
  • Brad Perry
چکیده

An application of data mining techniques to heterogeneous database schema integration is introduced. We use attribute-oriented induction to mine for characteristic and classification rules about individual attributes from heterogeneous databases. Each mining request is conditioned on a subset of attributes identified as "common" between the multiple databases. We develop a method to compare the rules for two or more attributes (from different databases) and use the similarity between the rules as a basis to suggest similarity between attributes. As a result, we use relationships between and among entire sets of attributes from multiple databases to drive the schema integration process. Our initial efforts and prototypes applying data mining to assist schema integration prove promising and, we feel, identify a fruitful application area for data mining research. goywords : schema integration, multi-database interrelationships, attribute similarity, data mining, attribute-oriented induction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OWL as Yet Another Data Model to be Integrated

The paper argues against cultivation in the ontological community of the opinion that ontologies are at the "semantic" level, whereas database schema are models of data at the "logical" or "physical" level. The paper claims that rather it would be right to consider OWL as yet another data model to be integrated with other heterogeneous information models. Applying the SYNTHESIS – an extensible ...

متن کامل

Investigating a heterogeneous data integration approach for data warehousing

Data warehouses integrate data from remote, heterogeneous, autonomous data sources into a materialised central database. The heterogeneity of these data sources has two aspects, data expressed in different data models, called model heterogeneity, and data expressed within different schemas of the same data model, called schema heterogeneity. AutoMed is an approach to heterogeneous data transfor...

متن کامل

Schema Evolution in Data Warehousing Environments - A Schema Transformation-Based Approach

In heterogeneous data warehousing environments, autonomous data sources are integrated into a materialised integrated database. The schemas of the data sources and the integrated database may be expressed in different modelling languages. It is possible for either the data source schemas or the warehouse schema to evolve. This evolution may include evolution of the schema, or evolution of the m...

متن کامل

Integration of Heterogeneous Object

In a heterogeneous database system which consists of object databases, a global schema created by integrating schemas of the component databases can provide a uniform interface and high level location transparency for the users to retrieve data. The main problem for constructing a global schema is to resolve connicts among component schemas. In this paper, we deene corresponding assertions for ...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995